Cue Phrase Classi cation Using Machine Learning
نویسنده
چکیده
Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. This paper explores the use of machine learning for classifying cue phrases as discourse or sentential. Two machine learning programs (cgrendel and C4.5) are used to induce classi cation models from sets of pre-classi ed cue phrases and their features in text and speech. Machine learning is shown to be an e ective technique for not only automating the generation of classi cation models, but also for improving upon previous results. When compared to manually derived classi cation models already in the literature, the learned models often perform with higher accuracy and contain new linguistic insights into the data. In addition, the ability to automatically construct classication models makes it easier to comparatively analyze the utility of alternative feature representations of the data. Finally, the ease of retraining makes the learning approach more scalable and exible than manual methods.
منابع مشابه
Cue Phrase Classi cation Using Machine
Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. T...
متن کاملCue Phrase Selection In Instruction Dialogue Using Machine Learning
The purpose of this paper is to identify e ective factors for selecting discourse organization cue phrases in instruction dialogue that signal changes in discourse structure such as topic shifts and attentional state changes. By using a machine learning technique, a variety of features concerning discourse structure, task structure, and dialogue context are examined in terms of their e ectivene...
متن کاملModeling Classi cation and Inference Learning
Human categorization research is dominated by work in classi cation learning. The eld may be in danger of equating the classi cation learning paradigm with the more general phenomenon of category learning. This paper compares classi cation and inference learning and nds that different patterns of behavior emerge depending on which learning mode is engaged. Inference learning tends to focus subj...
متن کاملThe Error Coding Method and PICTs
A new family of plug-in classi cation techniques has recently been developed in the statistics and machine learning literature. A plug-in classi cation technique (PICT) is a method that takes a standard classi er (such as LDA or TREES) and plugs it into an algorithm to produce a new classi er. The standard classi er is known as the Base Classi er. These methods often produce large improvements ...
متن کاملTransductive Inference for Text Classi cation using Support Vector Machines
This paper introduces Transductive Support Vector Machines (TSVMs) for text classi cation. While regular Support Vector Machines (SVMs) try to induce a general decision function for a learning task, Transductive Support Vector Machines take into account a particular test set and try to minimize misclassi cations of just those particular examples. The paper presents an analysis of why TSVMs are ...
متن کامل